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Abstract 

We present the LexGram system, an amalgam of (Lambek) categorial grammar 
and Head Driven Phrase Structure Grammar (HPSG), and show that the grammar 
formalism it implements is a well-structured and useful tool for actual grammar 
development. 

1 Introduction 

Grammar development becomes easier if the grammar formalism which has been chosen 
caters for the basic needs of syntactic modelling. According to the current state of 
affairs, grammars for natural languages differ in the following two points from grammars 
for formal languages. First, the lexicon contains complex syntactic information. Second, 
natural language admits nonlocal syntactic dependencies. 

Complex lexical information can be kept manageable if the grammar formalism in- 
cludes an inheritance mechanism. For a reliable and efficient handling of nonlocal de- 
pendencies, moved and empty constituents must be official components of the grammar 
formalism. The existence of a representational tool which allows for relating dislocated 
syntactic constituents systematically with their base positions, leads to more concise 
grammars, i.e. modelling effort is saved. On the processing side, unnecessary struggles 
with getting the grammar actually running will be avoided if the conflict in informa- 
tion flow between the top-down propagation of a moved constituent and the bottom-up 
manner of building a base-generated syntactic structure, e.g. filling a subcategorization 
list, is taken care of by the grammar interpreter. Head-Driven Phrase Grammars (HPSG 
[22]) seems to be an excellent candidate to fulfill the just mentioned requirements. They 
are based on typed feature terms with an inheritance mechanism and treat nonlocal de- 
pendencies by the Nonlocal Feature Principle (NFP). Unfortunately, on the processing 
side, due to the fact that HPSG is neither a pure (phrase structure) rule-based system 

*The research reported here has been funded by the Sonderforschungsbereich 340 "Sprachtheoretische 
Grundlagen fiir die ComputerHnguistik" , a project of the German National Science Foundation DFG. 



nor a pure lexicalist approach a genuine and efficient HPSG-interpreter seems to be still 
being searched for. The direct interpretation of an HPSG grammar by a successive re- 
finement of its phrase structure schemata as it has been suggested in [5] corresponds to 
a top-down parser, whose deficiencies are well-known since the early age of top-down 
interpreted Definite Clause Grammars: A phrase structure schema may be inserted (pos- 
sibly infinitly many times) although its applicability is restricted or even excluded by 
the input string (e.g. exploration of the adjunct scheme although no adjuncts occur in 
the input string or proposal of an infinite number of empty constituents by unrestricted 
application of the filler-head scheme.) The more sophisticated parsing methods which 
have been developed for grammars with a 'context-free' skeleton, i.e. with an informative 
phrase structure rule component, cannot reach their full efficiency since in HPSG phrase 
structure rules have been turned into phrase structure schemata by moving information 
to the lexicon. Most of the syntactic information is not located in the place where it is 
expected by these parsing algorithms. Parsers for lexicalized grammars, e.g. Lexicalized 
Tree Adjoining Grammars or categorial grammars, are not quite adequate either, since 
not all of the syntactic descriptions are anchored in the lexicon (i.e. filler-head structures 
and adjunct-head structures). This means that although HPSG is attractive for various 
reasons, a variant of HPSG is required for practical applications which is amenable to 
efficient processing in a straightforward manner. 

Since HPSG is closer to a lexicalized grammar than to a rule-based grammar \ it seems 
more natural to design a lexicalized version of HPSG than to compromise the essence of 
HPSG by transforming the phrase structure schemata back into phrase structure rules. It 
turns out that this is not just a matter of taste, since lexicalized grammars favor efficient 
processing (cf. [23], [24]) and enhance grammar design: 

• The grammar interpreter can work with a restricted view on the grammar specifica- 
tion: Only those syntactic descriptions have to be considered which can be accessed 
through the words in the input string. Hence, each step of the search for an anal- 
ysis is licensed by some input word. This helps to decrease the processing time in 
practice due to the reduced and controlled search space. In particular, the notori- 
ous problems of grammars which admit empty constituents are kept manageable, 
if every trace is licensed by a lexical filler category. 

• Furthermore, top-down and bottom-up approaches to parsing can be joined in ideal 
manner. Since there is no rule component in between a goal to be derived and the 
lexicon, each top-down driven step can access immediately the lexicon (without 
going through a 'link relation', cf. figure 1) and trigger the next bottom-up step. 

• Grammar modelling becomes simpler, since there is only one viewpoint on syntactic 
structure: from the lexemes. In non-lexicalized grammars, generalizations can only 
be carried out for the lexicon and the phrase structure component in separation, 
cf. the specification of adjunction in HPSG which is split into a phrase structure 
schema plus a lexical schema. In a lexicalized grammar, this collapses into one 
single, lexical schema. 



^For a comparison of HPSG and categorial grammars, see [21] 
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Figure 1: Architecture of phrase structure rule-based grammars vs. architecture of lexical 
grammars 

Now, one could argue for an automatic compilation of HSPG into Lexicalized Feature- 
based Tree Adjoining Grammars as proposed in [20]. However, by such a compilation 
only the processing virtues of lexicalized grammars are gained. In order to enjoy the 
conceptual elegance which arises from the uniform view on grammar in lexicalized ap- 
proaches, grammar should be encoded directly in a lexicalist manner. We want to propose 
a grammar formalism, called LexGram^ which is derived from HPSG by lexicalizing the 
phrase structure principles and schemata of HPSG. At this point, the reader might object 
that there is nothing new about the unification-based categorial grammar which results 
from this enterprise. However, existing unification-based categorial grammars do not in- 
corporate HPSG's principled way of treating nonlocal dependencies, since they take as 
their basis a basic categorial grammar [10], [26], [28], [2] resp. a Combinatory Categorial 
Grammar [27]. On the other hand, those categorial grammars which come with a log- 
ically well-defined treatment of moved constituents, the Lambek categorial grammars^, 
are usually not furnished with a feature term component. Basically, the LexGram sys- 
tem realizes a linguistically motivated extension of the Lambek calculus^. LexGram has 
been built on top of the CUF system [3], which takes care of the handling of (recursively 
defined) typed feature terms. 

^The LexGram system, a small grammar for German, and their documentation [13] is available via 
anonymous ftp. 

^Lambek categorial grammars take the Lambek calculus [15] as their rule system. For linguistic 
applications and extensions see e.g. [16] and [17]. 

"'The formal semantics of feature-based Lambek categorial grammars is investigated in [4]. 



2 From HPSG to categorial grammar 



2.1 Reduction to one single phrase structure schema 

Lexicalizing HPSG means to turn its phrase structure component ultimately schematic, 
by admitting only one single phrase structure schema. More specific postulations on the 
information flow in a grammar, which are not covered by that single schema, have to 
be expressed in the lexical signs. I.e. the phrase structure component is impoverished 
whereas the lexical schemata or word class descriptions are enriched. 

The phrase structure component of HPSG consists essentially of the four phrase struc- 
ture schemata (immediate dominance schemata) head-complement-, head-marker-, head- 
adjimct, head-filler structures, and of a number of principles. The major phrase structure 
principles are the Head Feature Principle^ 
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The schema for head-complement structures is the least restrictive one and closely 
mirrors the categorial grammar idea of a functor category 'subcategorizing' for a certain 
number of arguments. Hence, the head-complement schema will be the only one to survive 
this lexicalization enterprise. Specifiers, adjuncts, and fillers will have to become functor 
categories, i.e. 'heads' in a generalized sense of the word. We get rid of the head-marker 
schema by assuming an analysis of complementizers and determiners as functional heads, 



^We simply write the local and nonlocal feature structures as a pair, omitting the features. Feature 
structures are written in CUF notation, using the Prolog conventions for variables and lists. 



cf. [7], [19]. Instead of using a special mod- feature with a special treatment (bypassing 
the phrase structure principles) to express the fact that an adjunct 'subcategorizes' for 
the phrase it modifies, 
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allows us to remove the head-adjunct schema. A treatment of fillers as heads has been 
proposed by [11], [6] as a transliteration of type-raised categories X/(Y/Z) (cf. [8], [1]) into 
HPSG-sign format (assuming the subcat value to be a list of signs): 
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This schema of a lexical sign means that the derivation of the complement Y should 
include a trace Z as one of its leaves. The trace Z has to be mentioned twice in order to 
get the desired effect in interaction with the Nonlocal Feature Principle (3). One would 
wish to state a more concise lexical schema for fillers, i.e. 
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To accomodate schema (7), the slash-feature has to be admitted for synsem-structures 
which occur on the subcat list, and the Nonlocal Feature Principle must be changed in 
order to retrieve the slash information from the complement's synsem value. Since no 
confusion can arise any longer, the feature inherited_slash can be omitted from the 
nonlocal structure: 
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Merging the Head Feature Principle (1), the Subcategorization Principle (2) and the 
revised Nonlocal Feature Principle (8), we get the schema in (9), which is the only 
ingredient of the phrase structure component of the lexicalized grammar. 
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2.2 Relating the single schema to a categorial grammar 

In order to get access to the wealth of research concerning the logical foundations of 
categorial grammars, we will argue that the kernel of HPSG and a variant of the Lambek 
calculus are equivalent. 

The head/subcat structures of HPSG are an uncurried notation for syntactic cate- 
gories in a categorial grammar. The translation scheme between both representations is 
sketched by example 1, ignoring surface word order issues (as we will do in the remain- 
der of this section). Essentially, /-operators at embedding level 1 mark elements of the 
subcat list, whereas /-operators at embedding level 2 correspond to the to_bind_slash 
of HPSG. 



Example 1 (Synsem's vs. categories) 
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It is common knowledge that the Subcategorization Principle (2) reflects the rule of 
functional application X/Y, Y X in categorial grammar. But what about the Non- 
local Feature Principle? Subsequently, we will show that this principle mimics the hypo- 
thetical reasoning mechanism of the Lambek calculus. To get the direct correspondence. 



we define a version of the phrase structure schema in (9) which is ressource-conscious 
with respect to the nonlocal information (using disjoint union l±) of multi-sets instead of 
set union). For later convenience, the string information phonology is added, and the, 
possibly dynamically generated, 'lexical' entry for the empty string is shown. 

Definition 1 (Phrase Structure Schema + Slash Termination Schema) 

(Leftward) Phrase Structure Schema: 
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The rightward Phrase Structure Schema is analogous with complement and head daughter 
interchanged. 

Slash Termination Schema: 
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Actually, there is a close correspondence between signs and sequents (of a Natural De- 
duction calculus in sequent format). 

Example 2 (Signs vs. sequents) 

Phon 

44> Phon Synsem 

Synsem 

However, the nonlocal value does not fit into that picture. If we allow the phon value to 
be a list of words and/or synsem's. Slash Termination can be incorporated into the Phrase 
Structure Schema. I.e. traces, which are postulated by a to_bind_slash are posited 
immediately in the complement's phon value instead of positing them on demand. Then 
the bookkeeping with respect to the inherited_slash can be omitted. The equivalence 
of definition 1 and definition 2 can be shown by inductions on the size of derivations. 



Definition 2 (non-threading Phrase Structure Schema) 

(Leftward) Phrase Structure Schema: 
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The non-threading Phrase Structure Schema of definition 2 condenses an application 
of the rule (/Elim) plus n applications (n length of ^i) of the rule (/Intro) of a version 
of the Lambek calculus: 

Definition 3 (a semi-directional Lambek calculus) 

(ax) X X 
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3 The LexGram formalism 

In this section, the actual implementation of the LexGram formalism will be described. 

3.1 Data type for lexicalized syntactic structures 

In order to gain efficiency, we want the subcategorization information to reflect directly 
the expected surface order of the complements. In HPSG, the word order information 
is defined separately from the lexical subcategorization information as 'constituent order 
principles'. However, if word order is a part of the subcategorization information, the 
work of the grammar interpreter becomes much simpler because it is sufficient to use 
string concatenation instead of expensive permutation operations. In order to avoid 
confusion with the slightly difi'erent semantics of HPSG feature names, we decided to 
rename head into root, and subcat into leaves. The synsem type is replaced by the 
stree type in figure 2. Each element on the leaves list is annotated with its direction 
wrt. the head of the phrase. The leaves list induces a binary tree, where elements at the 
beginning of the list are closer to the head than those towards the end of the list. The 
set of trees^ which correspond to uses of the stree type in the lexicon is described more 
formally in definition 4. 



^The view that a LexGram grammar is a lexicon which maps strings on trees is reminiscent of 
Lexicalized Tree Adjoining Grammars (LTAG's) [23]. However, the two formalisms cannot be strongly 
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Figure 2: Data type for syntactic information 



Definition 4 (Lexical tree description) 

1. A lexical tree description has exactly one terminal leaf, the head leaf. 

2. It is a binary tree (except for the nontrivial subtree whose root is a preterminal 
node.) 

3. All the inner nodes of the tree belong to the path between the root and the head leaf. 

Example 3 (Lexical tree description) 
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Note that open leaf nodes, i.e. expected complements, are tree descriptions themselves 
(of type arg_stree). This admits complements with a non-empty leaves list in order to 
encode control phenomena, or complements with a non-empty slash value for treating 
movement phenomena, as discussed in the section 2. 



equivalent. On the one hand, LexGram trees are restricted to be binary trees (or flattened trees derived 
from binary trees). On the other hand, concerning the handhng of nonlocal dependencies, it seems that 
the adjunction operation of TAG can only model one filler-trace relation per local tree. 
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Figure 3: Graphical sketch of a head-driven parser {L lexicon) 
3.2 Grammar interpreter 

The task of the grammar interpreter (i.e. the parser or generator) is to construct a 
complete syntax tree for a given string or input semantics from the partial trees specified 
in the grammar. Subsequently, only the parsing algorithm of the LexGram system will 
be presented. A generator can be spelled out along the same lines cf. Konig 1994 [14] 
and has been implemented, as well. 

The parsing algorithm is an adaptation and refinement of the head-driven parser 
for Lambek categorial grammars (cf. [12], [9]). The basic mechanism, without traces, is 
sketched in figure 3 relying on the (imperfect) graphical representation of the stree type. 
To be more precise, we will give a sequent-style characterization of the full algorithm. In 
contrast to the Phrase Structure Schema of definition 1, the direction information that 
comes with a complement is honored by the parser. However, there is no directionality 
attached to the elements of the slash value. The list order of the slash is disregarded, 
i.e. the slash values are treated as multi-sets. The resulting system is in between a 
directional and a nondirectional Lambek calculus with the directional operators \ and / 
at odd levels of embedding and an undirectional operator | at even levels of embedding^, 
e.g. (Xo/Xi)/((X2|X3)|X4). 

For efficiency reasons, sequences are represented as pairs of string positions^. Traces 
are posited on demand, i.e. an explicit slash threading mechanism is required. Since 
the slash value is considered as an unordered (multi-)set, an encoding by pairs of list 
positions is not available, only a difference-list representation Slashln-SlashOut. The 
rule (lex) in figure 4 chooses a potential head from the string^. A trace which could 
serve as the head of the current phrase is taken from the incoming slash value by the 
rule (trace), which guesses the insertion point of the trace in the string. The axiom 

^This leads to asymmetries in the rule system (no introduction rules for \ and /, no elimination rules 
for I), but this simple-minded design proved to be useful enough for writing a small, but non-trivial 
grammar for German. 

^This was suggested by Jochen Dorre and implemented by Peter Krause. Interestingly, Morrill [18] 
introduced recently the representation of sequences as pairs of indices in a more general manner for 
Lambek deduction and related it to the formal semantics of Lambek systems. 

^Sequent rules are usually read in a backward chaining manner, i.e. starting from the conclusion. 
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Figure 5: Head-driven parser: Reduction of subcategorization list 



scheme (axiom) in figure 5 is the base of the recursion on the subcategorization list of 
the head in the rule (PSS/) (resp. (PSS\)). Each recursion step triggers a new subproof 
which will be fed into the 'choice of head' rules. The slash values have to be represented 
as nested lists in order to control whether all the elements of Si are realized as traces in 
the derivation of the complement. A sample run of the head-driven parser is sketched in 
figure 6. 

Empty constituents are handled safely because there are only traces available which 
have been licensed by words in the input string. In the current implementation of the 
parser, which caches the lexical lookup, parsing times are in the range of seconds. For 
example, it takes approx. two seconds to parse a German sentence with a dozen words 
which includes the following syntactic phenomena: movement of the main verb to verb 
first position, movement of a complement into the vorfeld, two adjectives, and a relative 
clause. Simultaneously to all these syntactic manipulations, a Discourse Representation 
Structure is built up as a feature structure. 
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Figure 6: Sample run of the head-driven parser 

4 Encoding a grammar 

A lexicalist grammar formalism with a built-in representation of movement supports 
an object-centered view on grammatical descriptions: The description of the syntactic 
behavior of a word is composed by inheriting information along a word class hierarchy. If 
a constituent may be moved, an instance of a movement schema is added as one possible 
lexical characterization of that constituent (or its lexical head) . 

4.1 The word class hierarchy 

Since all syntactic descriptions are located in the lexicon, the whole grammar can be seen 
as a single taxonomy of word classes whose leaves are the lexical entries, e.g. figure 7. 
For example, the most general syntactic structure is syntax_tree which helps to abstract 
from the realization of the stree data type in terms of features. syntax_tree is encoded 
as a two-place CUF sort: 

/ „ , \ f root : Root k\ 

syntax_tree Root, Leaves := (10) 
V / y leaves : Leaves J 

A subclass-superclass relation is represented as a CUF sort (the subclass) calling its 
superclass as a subgoal, i.e. subclass := superclass. For example, the definition of 
maxpro j constrains the list of leaves to be empty: 

maxpro j [ Root ):= syntax_tree[ Root, j (11) 
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Figure 7: Part of the word class hierarchy 



CUF sorts are the appropriate representation^" because the 'expansion' of a lexical en- 
try corresponds to a bottom-up traversal of the word class hierarchy^^, which is easily 
simulated by the backward chaining mechanism of proving a CUF goal. 

4.2 Movement 

Since traces are official components of the proposed grammar formalism, concise descrip- 
tions of phrase structures which involve nonlocal dependencies are feasible. Similar to 
the type-raising rules in categorial grammar, the movement of a phrase means that its 
base representation is the (single) element of the slash value of the (first) complement, 



The ability of determiner phrases to move to the vorf eld will be expressed by adding 
this alternative to the basic representation of dp's: 

Actually, CUF sorts are too expressive since no recursion is required to describe the word class 
hierarchy. A unification-based macro or template mechanism would be enough, cf. [25]. But for the 
grammar as a whole, the full expressivity of CUF sorts is needed as a declarative counterpart of what 
used to be called 'procedural attachments', e.g. to construct semantic representations. 

^^Note that in order to implement an HPSG faithfully, two different realizations of inheritance are 
required: First, inheritance which is carried out by a top-down traversal of the taxonomy of phrase 
structure principles and schemata in order to simulate the parsing process, i.e. a ramification in a syntax 
tree is obtained by speciaHzing a phrase structure schema. Second, inheritance which is reahzed by a 
bottom-up traversal of a word class hierarchy in order to profit from lexical generalizations. 
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/ cp_specifier & \ 
y movement_schema^ Base j j ' 
basic_dp := maxpro j dp ^. ^i^'j 
deteriiiiner_phrase := basic_dp. 
determiner_phrase := vorfeld^ basic_dp 

Compare (13) to the covert trace analysis of Pollard and Sag [22, chapter 9] where overt 
traces are avoided by adding alternative, shorter subcategorization frames to the word 
class (here the verb) whose complements can be moved. (The trace still appears on the 
slash value.) Computationally, this is more costly than the solution in (13) because e.g. 
three alternatives will be added to the lexicon entry of a three-place verb, which have 
to be explored during parsing whereas (13) adds only one alternative to a lexical entry. 
Furthermore, conceptually, the covert trace solution seems to be less clear: It is the dp 
that may move from its base position, why should the verb know about it (unless the 
verb puts constraints on the movement of its complement) . 



5 Conclusion 

Abstract specifications in the style of HPSG have become an ideal in some groups of the 
computational linguistic community. However, it seems that most people who implement 
a grammar 'in the spirit of HPSG' have to compromise in order to get efficiency. In this 
paper, we explained one possible compromise: First, turn the phrase structure schemata 
and principles ultimately schematic by admitting only the head-complement schema. 
Second, hard-wire these principles into an efficient grammar interpreter, i.e. parser or 
generator. 

The grammar writer is supported because the grammar interpreter incorporates a 
certain amount of linguistic knowledge, i.e. a general phrase structure schema including 
a built-in treatment of non-local dependencies. The lexicalist approach provides for 
a uniform view of grammar as a word class hierarchy, a clean basis for code-sharing 
among lexical entries. Additional benefits come from the underlying typed feature-based 
language. Since the signature (the type system) has to be defined before the rules, i.e. 
the CUF sorts, are spelled out, the writing of more transparent grammars is encouraged 
and errors are discovered much more often at compile time. Further, CUF sorts can serve 
as interfaces among the grammar modules. 

So far, no specific tools for large scale grammar development and debugging are avail- 
able in CUF/LexGram. A module concept is necessary to structure the grammar properly 
into submodules and to guarantee for data encapsulation. The tools for inspecting the 
data-fiow in a grammar should be still improved. At compile time, a graphical represen- 
tation of the word class hierarchy etc. could be extracted automatically. At run time, the 
user needs (more) means to visualize and to control the co-routining of subgoals. 



References 



Anthony E. Ades and Mark J. Steedman. On the order of words. Linguistics and Philos- 
ophy, 4:517-558, 1982. 

Gosse Bouma and Gertjan van Noord. Contraint-based categorial grammar. In Proceedings 
of the International Conference on Computational Linguistics, Kyoto, Japan, 1994. 

Jochen Dorre and Michael Dorna. CUF - a formalism for linguistic knowledge representa- 
tion. Deliverable R.1.2A, DYANA 2, August 1993. 

Jochen Dorre, Dov M. Gabbay, and Esther Konig. Fibred semantics for feature-based 
grammar logic. Journal of Logic, Language and Information. Special Issue on Language 
and Proof Theory, to appear. 

Martin Emele and Remi Zajac. Typed unification grammars. In Proceedings of 13th 
International Conference on Computational Linguistics, Helsinki, Finland, 1990. 

Anette Frank. Verb second by underspecification. In Harald Trost, editor. Proceedings of 
KONVENS'94, pages 121-130, Berlin, 1994. Springer- Verlag. 

Hubert Haider. Deutsche Syntax - generativ. Vorstudien zur Theorie einer projekiven 
Grammatik. Narr, Tiibingen, 1993. 

Mark Hepple. The Grammar and Processing of Order and Dependency: a Categorial 
Approach. PhD thesis, University of Edinburgh, Edinburgh, Scotland, 1990. 

Mark Hepple. Normal form theorem proving for the Lambek calculus. In Hans Karlgren, 
editor, Proceedings of the 13th International Conference on Computational Linguistics, 
pages 173-178, Helsinki, 1990. Helsinki University. 

Lauri Karttunen. Radical lexicalism. In Mark Baltin and Anthony Kroch, editors. Alter- 
native Conceptions of Phrase Structure, pages 43-65. University of Chicago Press, 1989. 

Tibor Kiss and Birgit Wesche. Verb order and head movement. In Otthein Herzog and 
Claus-Rainer RoUinger, editors. Text Understanding in LILOG, pages 216-240. Springer, 
Berlin, 1991. 

Esther Konig. Parsing as natural deduction. In Proceedings of the 27th Annual Meeting of 
the Association for Computational Linguistics, pages 272-279, Vancouver, B.C., 1989. 

Esther Konig. A study in grammar design. Arbeitspapier des Sonderforschungsbereich 
340 54, Institut fiir Maschinelle Sprachverarbeitung, Universitat Stuttgart, 1994. 

Esther Konig. Syntactic-head-driven generation. In Proceedings of the International Con- 
ference on Computational Linguistics, Kyoto, Japan, 1994. 

Joachim Lambek. The mathematics of sentence structure. American Mathematical 
Monthly, 65:154-170, 1958. Reprinted in: Buszkowski, W., W. Marciszewski, and J. van 
Benthem (eds): Categorial Grammar. John Benjamins, Amsterdam 1988. 

Michael Moortgat and Richard Oehrle. Adjacency, dependency and order. In Dekker and 
Stokhof, editors. Proceedings of the 9th Amsterdam Colloquium, 1993. 



[17] Glyn Morrill. Type Logical Grammar: Categorial Logic of Signs. Kluwer Academic Pub- 
lishers, Dordrecht, 1994. 

[18] Glyn Morrill. Higher-order linear logic programming of categorial deduction. In Proceedings 
7th Meeting of Association for Computational Linguistics, European Chapter, Dublin, 1995. 

[19] Klaus Netter. Towards a theory of functional heads: German nominal phrases. In John 
Nerbonne, Klaus Netter, and Carl Pollard, editors, German in Head Driven Phrase Struc- 
ture Grammar, 1994. 

[20] Klaus Netter, Robert Kasper, Bernd Kiefer, and K. Vijay-Shanker. HPSG and TAG. In 
3e Collogue International sur les Grammaires d'Arbres Adjoint (TAG+3), pages 77-82, 
Paris, 1994. 

[21] Carl Pollard. Categorial grammar and phrase structure grammar: An excursion on the 
syntax-semantics frontier. In Emmon Bach, Richard Oehrle, and Deirdre Wheeler, ed- 
itors, Categorial Grammars and Natural Languages Structures. Reidel, Dordrecht, 1988. 
Proceedings of the Conference on Categorial Grammar, Tucson 1985. 

[22] Carl Pollard and Ivan A. Sag. Head Driven Phrase Structure Grammar. University of 
Chicago Press, Chicago, 1994. 

[23] Yves Schabes, Anne Abeille, and Aravind Joshi. Parsing strategies for lexicalized gram- 
mars: Application to Tree Adjoining Grammars. In Proceedings of the 12th International 
Conference on Computational Linguistics, pages 578-583, 1988. 

[24] Yves Schabes and Aravind Joshi. The relevance of lexicalization to parsing. In Proceed- 
ings of the International Workshop on Parsing Technologies, pages 339-349, Pittsburgh, 
Pennsylvania, 1989. Carnegie Mellon University. 

[25] Stuart M. Shieber. An Introduction to Unifi,cation-Based Approaches to Grammar. Lecture 
Notes. Center for the Study of Language and Information, Stanford, Ca., 1986. 

[26] Hans Uszkoreit. Categorial Unification Grammars. In Proceedings of the 11th International 
Conference on Computational Linguistics, pages 187-194, Bonn, 1986. 

[27] Kent B. Wittenburg. Natural Language Parsing with Combinatory Categorial Grammar in 
a Graph-unification-based Formalism. PhD thesis. University of Texas, Austin, 1986. 

[28] Henk Zeevat, Ewan Klein, and Jo Calder. Unification categorial grammar. Technical 
Report EUCCS/RP-21, Centre for Cognitive Science, University of Edinburgh, Edinburgh, 
Scotland, 1986. 



